Principal Clusters Analysis: Analyzing Web Navigation Using a Multivariate Technique

نویسندگان

  • Harris Wu
  • Michael D. Gordon
  • Kurt DeMaagd
  • Weiguo Fan
  • Huijiang Wu
چکیده

We present a new statistical approach, called principal clusters analysis, for analyzing millions of user navigations among Web documents. This technique can identify distinct clusters of related information on a given topic. In addition, it can determine which information items within a cluster are useful starting points to explore the topic of the cluster, as well as key documents within the cluster to explore the topic in greater detail. This technique should prove promising in addressing information overload and other knowledge management issues.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Mining web navigations for intelligence

The Internet is one of the fastest growing areas of intelligence gathering. We present a statistical approach, called principal clusters analysis, for analyzing millions of user navigations on the Web. This technique identifies prominent navigation clusters on different topics. Furthermore, it can determine information items that are useful starting points to explore a topic, as well as key doc...

متن کامل

designing and implementing a 3D indoor navigation web application

​During the recent years, the need arises for indoor navigation systems for guidance of a client in natural hazards and fire, due to the fact that human settlements have been complicating. This research paper aims to design and implement a visual indoor navigation web application. The designed system processes CityGML data model automatically and then, extracts semantic, topologic and geometric...

متن کامل

Monitoring and assessment of a eutrophicated coastal lake using multivariate approaches

Multivariate statistical techniques such as cluster analysis, multidimensional scaling and principal component analysis were applied to evaluate the temporal and spatial variations in water quality data set generated for two years (2008-2010) from six monitoring stations of Veli-Akkulam Lake and compared with a regional reference lake Vellayani of south India. Seasonal variations of 14 differen...

متن کامل

شناسایی ساختار محتوایی مطالعات علم اطلاعات و دانششناسی بر اساس واژگان و مفاهیم مقالات آن در پایگاه اطلاعاتی وب‌آوساینس (2009-2013)

This study aimed to identify and analyze the structure of “Knowledge and Information Science (KIS)” scientific articles using co-word analysis in the “Web of Science (WoS)” database. Methodology of this study was content analysis of articles. By co-word analysis of the articles, subjects and concepts of KIS were identified, using Between-Groups Linkage algorithm in clustering techniques. The st...

متن کامل

Choosing the Best Hierarchical Clustering Technique Based on Principal Components Analysis for Suspended Sediment Load Estimation

1- INTRODUCTION The assessment of watershed sediment load is necessary for controling soil erosion and reducing the potential of sediment production. Different estimates of sediment amounts along with the lack of long-term measurements limits the accessibility to reliable data series of erosion rate and sediment yield. Therefore, the observed data of suspended sediment load could be used to ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002